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Hodge Laplacians on Graphs 

Lek-Heng Lim 


Abstract. This is an elementary introduction to the Hodge Laplacian on a 
graph, a higher-order generalization of the graph Laplacian. We will discuss 
basic properties including cohomology and Hodge theory. At the end we will 
also discuss the nonlinear Laplacian on a graph, a nonlinear generalization of 
the graph Laplacian as its name implies. These generalized Laplacians will be 
constructed out of coboundary operators, i.e., discrete analogues of exterior 
derivatives. The main feature of our approach is simplicity — this article 
requires only knowledge of linear algebra and graph theory. 


1. Introduction 

The primary goal of this article is to introduce readers to the Hodge Laplacian 
on a graph and discuss some of its properties, notably the Hodge decomposition. 
To understand its significance, it is inevitable that we will also have to discuss the 
basic ideas behind cohomology, but we will do so in a way that is as elementary as 
possible and with a view towards applications in the information sciences. 

If the classical Hodge theory on Riemannian manifolds I4TL 12 is “differen¬ 
tiable Hodge theory,” the Hodge theory on metric spaces mm “continuous Hodge 
theory,” and the Hodge theory on simplicial complexes [291 [31] “discrete Hodge 
theory,” then the version here may be considered “graph-theoretic Hodge theory.” 

Unlike physical problems arising from areas such as continuum mechanics or 
electromagnetics, where the differentiable Hodge-de Rlram theory has been applied 
with great efficacy for both modeling and computations Sl l30ll45ll53l l65l 1661170] . 
those arising from data analytic applications are likely to be far less structured 
[4UT5UT6U261I401I43U561I69]. Often one could at best assume some weak notion 
of proximity of data points. The Hodge theory introduced in this article requires 
nothing more than the data set having the structure of an undirected graph and 
is conceivably more suitable for non-physical applications such as those arising 
from the biological or information sciences. We will discuss two examples of such 
applications, to ranking and to game theory. 
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Readership. To some mathematical cognoscenti, cohomology is like calculus 
— an everyday tool used without a second thought. This article is not for them. It is 
intended for those who, like the author, have attempted to learn about cohomology 
from standard sources in algebraic topology but found them impenetrable and gave 
up. However, the author has had the good fortune to interact with some of those 
cognoscenti and did ultimately pick up bits and pieces through diffusion. One goal 
of this article is to share with a wider audience these bits and pieces that he believes 
make up the gist of the idea, which is remarkably simple. 

This article also serves as an expanded written record of the author’s lecture 
in the AMS Short Course on Geometry and Topology in Statistical Inference at the 
2014 Joint Mathematics Meetings. The target audience, as per AMS’s guidelines 
JXj , comprises: 

• the idly curious, knowing little or nothing specific of the field beyond a 
layman’s or graduate student’s familiarity; 

• peripheralists, who have read a few articles, perhaps dabbled once or twice 
in the field, and would like to have a perspective of the held presented on 
a silver platter; 

• young specialists and prospective teachers, who want to make sure they 
see the forest for the trees, and haven’t missed something significant. 

As such, the target readership of this article is one that would most likely appreciate 
our simple take on cohomology and Hodge theory, via an approach that requires 
only linear algebra and graph theory. 

In our approach, we have isolated the algebra from the topology to show that a 
large part of cohomology and Hodge theory is nothing more than the linear algebra 
of matrices satisfying AB = 0. For the remaining topological aspect, we cast our 
discussions entirely in terms of graphs as opposed to less-familiar topological objects 
like simplicial complexes. We believe that by putting these in a simple framework, 
we could facilitate the development of applications as well as communication with 
practitioners who may not otherwise see the utility of these notions. 

Lastly, we write with a view towards readers whose main interests, like the 
author’s, may he in machine learning, matrix computations, numerical PDEs, opti¬ 
mization, statistics, or theory of computing, but have a casual interest in the topic 
and may perhaps want to explore potential applications in their respective fields. 

The occasional whimsical section headings are inspired by [rni50H5Tll59H60j. 

2. Cohomology and Hodge theory for pedestrians 

We will present in this section what we hope is the world’s most elementary 
approach towards cohomology and Hodge theory, requiring only linear algebra. 

2.1. Cohomology on a bumper sticker. Given two matrices A £ R mxn 
and B £ R nxp satisfying the property that 

(2.1) AB = 0, 

a property equivalent to 

im(R) C ker(A), 

the cohomology group with respect to A and B is the quotient vector space 

ker(A)/ im(H), 
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and its elements are called cohomology classes. The word ‘group’ here refers to the 
structure of ker (A)/im(-B) as an abelian group under addition. 

We have fudged a bit because we haven’t yet defined the matrices A and B. 
Cohomology really refers to a special case when A and B are certain matrices with 
topological meaning, as we will define in Section [3] 


2.2. Harmonic representative. The definition in the previous section is 
plenty simple, provided the reader knows what a quotient vector space is, but 
can it be further simplified? For instance, can we do away with quotient spaces and 
equivalence classed] and define cohomology classes as actual vectors in R ra ? 

Note that an element in ker {A)/ im(B) is a set of vectors 

x + im (B) := {x + y £ R" : y £ im(B)} 

for some x £ ker (A). We may avoid equivalence classes if we could choose an 
Xh £ x + im(B) in some unique way to represent the entire set. A standard 
way to do this is to pick xh so that it is orthogonal to every other vector in 
im(B). Since im(i?) J - = ker(B*), this is equivalent to requiring that xh £ ker (B*). 
Hence we should pick an xh £ ker(A) D ker(H*). Such an xh is called a harmonic 
representative of the cohomology class x + irn(H). 

The map that takes the cohomology class x + im (B) to its unique harmonic 
representative Xh gives a natural isomorphism of vector spaces 

(2.2) ker(A)/im(H) = ker(A) n ker(U*). 


So we may redefine the cohomology group with respect to A and B to be the 
subspace ker(A) n ker(H*) of R", and a cohomology class may now be regarded as 
an actual vector xh £ ker(A) nker(J3*) (cf. Theorem A.31. 

A word about our notation: B* denotes the adjoint of the matrix B. Usually we 
will work over R with the standard Z 2 -inner product on our spaces and so B* = B J . 
However we would like to allow for the possibility of working over C or with other 
inner products. 

Numerical linear algebra interlude. For those familiar with numerical linear 
algebra, the way we choose a unique harmonic representative xh to represent a 
cohomology class x + irn(H) is similar to how we would impose uniqueness on a 
solution to a linear system of equations by requiring that it has minimum norm 
among all solutions [34, Section 5.5]. More specifically, the solutions to Ax = b 
are given by Xq + ker(A) where Xq is any particular solution; we impose uniqueness 
by requiring that :ro £ ker(A) _L = im(A*), which gives the minimum norm (or 
pseudoinverse) solution Xq = A^b. The only difference above is that we deal with 
two matrices A and B instead of a single matrix A. 


2.3. Hodge theory on one foot. We now explain why an element in ker( A )D 
ker(H*) is called ‘harmonic’. Again assume that AB = 0, the Hodge Laplacian is 
the matrix 


(2.3) A*A + BB* £ R nxn . 

It is an easy exercise (cf. Theorem A.21 to show that 

(2.4) ker (A* A + BB*) = ker(A) n ker(H*). 


^Practitioners tend to dislike working with equivalence classes of objects. One reason is that 
these are often tricky to implement in a computer program. 
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So the harmonic representative xh that we constructed in Section [2T2| is a solution 
to the Laplace equation 

(2.5) (A*A + BB*)x = 0. 


Since solutions to the Laplace equation are called harmonic functions, this explains 
the name ‘harmonic’ representative. 

With this observation, we see that we could also have defined the cohomology 
group (with respect to A and B) as the kernel of the Hodge Laplacian since 


ker(.A)/ irn(H) = ker (A* A + BB*). 


It is also an easy exercise (cf. Theorem A.2) to show that one has a Hodge decom¬ 
position , an orthogonal direct sum decomposition 


(2.6) R n = im (A*) © ker(A*A + BB*) © im (B). 


In other words, whenever AB = 0, every x £ R™ can be decomposed uniquely as 
x = A* w + xh + Bv , ( A* w , xh) = {xh, Bv) = ( A* w , Bv) = 0, 

for some v £ R p and w £ R m . 

Recall the well-known decompositions (sometimes called the Fredholm alterna¬ 
tive) associated with the four fundamental subspaces (641 of a matrix A £ R mx ", 


(2.7) 


= ker(A) © im(A*), 


= ker(A*) © im(A). 


The Hodge decomposition (2.6) may be viewed as an analogue of (2.71 for a pair 
of matrices satisfying AB = 0. In fact, combining (2.6) and (2.7), we obtain 

ker(B*) 

/ - A -N 

(2.8) R™ = im(A*) © ker(A*A + BB*) © im(B). 

s -v-' 

ker(A) 


The intersection of ker(A) and ker(H*) gives ker(A*A + BB*), confirming (2.4). 
Since A* A + BB* is Hermitian, it also follows that 


(2.9) 


im (A* A + BB*) = im(A*) © iin(H). 


For the special case when A is an arbitrary matrix and B = 0, the Hodge 


decomposition (2.6) becomes 


( 2 . 10 ) 


= im(A) © ker {A* A), 


which may also be deduced directly from the Fredholm alternative (2.7) since 
(2.11) ker {A* A) = ker (A). 

Numerical linear algebra interlude. To paint an analogy like that in the last 
paragraph of Section [2T2| o ur characterization of cohomology classes as solutions to 
the Laplace equation (2.5) is similar to the characterization of solutions to a least 


squares problem min^gRn || Ax — &|| as solutions to its normal equation A* Ax = A*b 
j34j Section 6.3]. Again the only difference is that here we deal with two matrices 
instead of just one. 














HODGE LAPLACIANS ON GRAPHS 


5 


2.4. Terminologies. One obstacle that the (impatient) beginner often faced 
when learning cohomology is the considerable number of scary-sounding terminolo¬ 
gies that we have by-and-large avoided in the treatment above. 

In Table |T 
Sections 12. 1[ |2.2 


we summarize some commonly used terminologies for objects in 
and 2.3 Their precise meanings will be given in Sections [3] and 
[4j with an updated version of this table appearing as Table [3] As the reader can 


Table 1. Topological jargons (first pass) 


NAME 

MEANING 

coboundary maps 

A e M mx ", B e 

cochains 

elements of R n 

cochain complex 

r A r A R m 

cocycles 

elements of ker(A) 

coboundaries 

elements of im(£?) 

cohomology classes 

elements of ker (A)/im(I3) 

harmonic cochains 

elements of ker(A*A + BB*) 

Betti numbers 

dim ker (A*A + BB*) 

Hodge Laplacians 

A* A + BB* e R nx ” 

x is closed 

Ax = 0 

x is exact 

x = Bv for some v £ R p 

x is coclosed 

B*x = 0 

x is coexact 

x = A* w for some w € R m 

x is harmonic 

(A*A + BB*)x = 0 


see, there is some amount of redundancy in these terminologies; e.g., saying that a 
cochain is exact is the same as saying that it is a coboundary. This can sometimes 
add to the confusion for a beginner. It is easiest to just remember equations and 
disregard jargons. When people say things like ‘a cochain is harmonic if and only 
if it is closed and coclosed,’ they are just verbalizing (2.4). 

In summary, we saw three different ways of defining cohomology: If A and B 
are matrices satisfying AB = 0, then the cohomology group with respect to A and 
B may be taken to be any one of the following, 


( 2 . 12 ) 


ker(A)/im(S), ker(A) PI ker(S*), ker (A*A + BB*). 


For readers who have heard of the term homology , that can be defined just by 
taking adjoints. Note that if AB = 0, then B* A* = 0 and we may let B* and A* 
play the role of A and B respectively. The homology group with respect to A and 
B may then be taken to be any one of the following, 


(2.13) 


ker(i?*)/im(A*), ker(.B*) n ker (A), ker (BB*+A*A). 


As we can see, the last two spaces in (2. 12|) and (2.13) are identical, i.e., there is no 


difference between cohomology and homology in our context (but see Section 5.1 
for caveats). 
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3. Coboundary operators and Hodge Laplacians on graphs 

Observe that everything in Section[2]relies only on the assumption that AB = 0. 
We haven’t been entirely forthright: What we have discussed up to this point is 
really the linear algebra of operators satisfying AB = 0 - the ‘algebraic half’ of 
of cohomology and Hodge theory. There is also a ‘topological half’ that is just 
one step away — they refer to the special case when A and B are the so-called 
coboundary operators. This special case allows us to attach topological meanings 
to the objects in Section [2] 

Just like the last section requires nothing more than elementary linear algebra, 
this section requires nothing more than elementary graph theory. We will discuss 
simplicial complexes (family of subsets of vertices), cochains (functions on a graph), 
and coboundary operators (operators on functions on a graph) — all in the context 
of the simplest type of graphs: undirected, unweighted, no loops, no multiple edges. 

3.1. Graphs. Let G = ( V,E) be an undirected graph where V := {1,... ,n} 
is a finite set of vertices and EC ( l 9 ) is the selnof edges. Note that once we have 
specified G, we automatically get cliques of higher order — for example, the set of 
triangles or 3-cliques T C (^) is defined by 

{i,j,k}cT iff {i,j},{i,k},{j,k} e E. 

More generally the set of k-cliques Kk(G) C is defined by 

{iii ■ • • ,h} G K k (G) iff {i P ,iq} € E for all 1 < p < q < k, 

i.e., all pairs of vertices in {i\,... ,ik} are in E. 

In particular we have 

K\ ( G ) = V, K 2 (G ) = E, K 3 (G) = T. 

Note that once we have specified V and E, all Kk(G) for k > 3 would have been 
uniquely determined. 

In topological parlance, a nonempty family K of finite subsets of a set V is 
called a simplicial complex (more accurately, an abstract simplicial complex) if for 
any set S in K, every S' C S also belongs to K. Evidently the set comprising all 
cliques of a graph G, 

i i w(G) 

K{G) := |J fc=1 ^ fc (G), 

is a simplicial complex and is called the clique complex of the graph G. The clique 
number ui(G) is the number of vertices in a largest clique of G. 

There are abstract simplicial complexes that are not clique complexes of graphs. 
For example, we may just exclude cliques of larger sizes - UfcLi -Kfc(G) is still an 
abstract simplicial complex for any m = 3,..., w(G) — 1, but it would not in general 
be a clique complex of a graph. 

3.2. Functions on a graph. Given a graph G = (V,E), we may define real¬ 
valued functions on its vertices / : V —>• M. We may also define real-valued functions 
on E and T and A'fc(G) in general but we shall require them to be alternating. By 
an alternating function on E, we mean one of the form X : V x V —> K. where 

X(i,j) = —X(j, i) 

2 Henceforth (^) denotes the set of all ^-element subsets of V. In particular E is not a 
multiset since our graphs have no loops nor multiple edges. 
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for all {i,j} £ E, and 

X(i,j)= 0 

for all {*, j} qL E. An alternating function on T is one of the form $ : V x V x V —>■ K. 
where 


$(i, j,fc) = ®(j,k,i) = = -$(k,j,i) 


for all {i,j, k} £ T, and 

k)=0 

for all {i,j, k} (jL T. More generally, an alternating function is one where permuta¬ 
tion of its arguments has the effect of multiplying by the sign of the permutation, 
as we will see in (4.1). 

In topological parlance, the functions /, X, $ are called 0-, 1-, 2-cochains. These 
are discrete analogues of differential forms on manifolds m ■ Those who prefer to 
view them as such often refer to cochains as discrete differential forms [ 271155 ] or 
Whitney forms [ 121168 ] and in which case, /, X , $ are 0-, 1-, 2-forms on G. 

Observe that a 1-cochain X is completely specified by the values it takes on 
the set {(i,j) : i < j} and a 2-cochain $ is completely specified by the values it 
takes on the set k) : i < j < k}. We may equip the spaces of cochains with 
inner products, for example, as weighted sums 


(3.1) 


(f,g)v = 


X , (X,Y) E WijX(i,j)Y(i,j), 

L '2—1 z — i<3 

($,T)t = w ij k $(i,j,k)^(i,j,k), 

z — 'i<3<k 


where the weights Wi,Wij,Wijk are given by any positive values invariant under 
arbitrary permutation of indices. When they take the constant value 1, we call it 
the standard l 2 -inner product. By summing only over the setf^{(?', j) : i < j} and 
{( i,j,k ) : i < j < fc}, we count each edge or triangle exactly once in the inner 
products. 

We will denote the Hilbert spaces of 0-, 1-, and 2-cochains as L 2 (V), L\(E), 
L\(T) respectively. The subscript A is intended to indicate ‘alternating’. Note that 
L 2 f\(V) = L 2 (V) since for a function of one argument, being alternating is a vacuous 
property. We set Lf(0) := {0} by convention. 

The elements of Lf(E) (i.e., 1-cochains) are well-known in graph theory, of¬ 
ten called edge flows. While the graphs in this article are always undirected and 
unweighted, a directed graph is simply one equipped with a choice of edge flow 
X £ L\(E) — an undirected edge {i,j} £ E becomes a directed edge (i,j) if 
X(i,j ) > 0 or (j,i) if X{i,j) < 0; and the magnitude of X(i,j ) may be taken as 
the weight of that directed edge. So Lf(E) encodes all weighted directed graphs 
that have the same underlying undirected graph structure. 


3.3. Operators on functions on a graph. We will consider the graph- 
theoretic analogues of grad, curl, div in multivariate calculus. The gradient is the 
linear operator grad : L 2 (V ) -* L\(E) defined by 

(grad = f(j) - /(*) 


^Our choice is arbitrary; any set that includes each edge or triangle exactly once would also 
serve the purpose. Each such choice corresponds to a choice of orientation on elements of E or T. 
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for all {?', j } £ E and zero otherwise. The curl is the linear operator curl : L\ (E) —> 
L 2 a {T) defined by 

(cwd X)(i,j,k) = X(i,j) + X(j,k) + X(k,i) 

for all {i,j,k} £ T and zero otherwise. The divergence is the linear operator 
div : L A (E) —> L 2 (V) defined by 

n 

(div X) (i) = Y xr X (^j) 

3 = 1 

for all i £ V. 

Using these, we may construct other linear operators, notably the well-known 
graph Laplacian , the operator A 0 : L 2 (V) L 2 (V) defined by 


Aq = — div grad, 


which is a graph-theoretic analogue of the Laplace operator (cf. Lemma B.3). Less 
well-known is the graph Helmholtzian , the operator Ai : L 2 A (E) —» L 2 A (E) defined 

by 

Ai = — grad div + curl* curl, 


which is a graph-theoretic analogue of the Helmholtz or vector Laplace operator. 
It is straightforward to derive an expression for the adjoint of the curl operator, 
curl* : L 2 a (T) -> L 2 A (E) is given by 


(curl* $)(*, j) = Y 

z ' m- • 


k=l 


for all {i, j} £ E and zero otherwise (cf. Lemma B.2). 


The gradient and curl are special cases of coboundary operators , discrete ana¬ 
logues of exterior derivatives , while the graph Laplacian and Helmholtzian are spe¬ 
cial cases of Hodge Laplacians. 

The matrices A and B that we left unspecified in Section [2] are coboundary 
operators. It is easy to see that the composition 


(3.2) 


curl grad = 0 


and so setting A = curl and B = grad gives us (2.1) (cf. Theorem B.4). 


Note that divergence and gradient are negative adjoints of each other: 
(3.3) div = — grad*, 


(cf. Lemma B.l). With this we get Ai = A*A + BB* as in (2.3). 


If the inner products on L 2 (V) and L 2 A (E ) are taken to be the standard Z 2 -inner 
products, then (3.3) gives A 0 = B*B = B J B , a well-known expression of the graph 


Laplacian in terms of vertex-edge incidence matrix B. The operators 


grad* grad : Lr{V) —>• L 2 (V) and curl* curl : L A (E) —► L A (E) 


are sometimes called the vertex Laplacian and edge Laplacian respectively. The 
vertex Laplacian is of course just the usual graph Laplacian but note that the edge 
Laplacian is not the same as the graph Helmholtzian. 
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Physics interlude. Take the standard / 2 -inner products on L 2 {V) and L\{E), 
the divergence of an edge flow at a vertex i £ V may be interpreted as the netflow , 


(3.4) (divX)(*) = (inflow V)(i) — (outflow X){i), 

where inflow and outflow are defined respectively for any X £ Lf{E) and any 
i £V as 


(inflow X){i) X(i,j), (outflow X){i) 


E 






Sometimes the terms incoming flux , outgoing flux, total flux are used instead of 
inflow, outflow, net flow. Figure [2] shows two divergence-free edge flows, i.e., inflow 
equals outflow at every vertex. 

Let X £ Lf(E). A vertex i € V is called a sink of X if X(i,j) > 0 for 
every neighbor {i,j} £ E of i. Likewise a vertex i £ V is called a source of X if 
X{i,j) < 0 for every neighbor {i,j} £ E of i. In general, an edge flow may not 
have any source or sinlQbut if it can be written as 


(3.5) 


X = - grad / 


for some / £ L 2 {V), often called a potential function, then X will have the property 
of flowing from sources (local maxima of /) to sinks (local minima of /). See 
Figure |8b| for an example of an edge flow given by the gradient of a potential 
function, flowing from two sources into two sinks. 


Example 3.1. We highlight a common pitfall regarding curl on a graph. Con¬ 
sider C 3 and C 4 , the cycle graphs on three and four vertices in Figure [lj 



Figure 1. Cycle graphs C 3 {left) and C 4 {right). 

Number the vertices and consider the edge flows in Figure [2] What are the 
values of their curl? For the one on C 3 , the answer is2 + 2 + 2 = 6 as expected. 
But the answer for the one on C 4 is not 2 + 2 + 2 + 2 = 8 , it is in fact 0. 

(*> - 2 -© 

2 2 

®- 2 -■© 

Figure 2. Edge flows on C 3 (left) and C 4 (right). 

^There is an alternative convention that defines i £ V to be a source (resp. sink) of X as 
long as div X (z) > 0 (resp. div X (i) < 0) but our definition is much more restrictive. 
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The second answer may not agree with a physicist’s intuitive idea of curl and 
is a departure from what one would expect in the continuous case. However it is 
what follows from the definition. Let X G L\{E{C 3 )) denote the edge flow on C 3 
in Figure [2] It is given by 

X{1, 2) = X(2, 3) = X(3,1) = 2 = -X(2, 1) = -X(3,2) = -X(l, 3), 

and the curl evaluated at {1,2,3} G T(C 3 ) is by definition indeed 

(curl X)(l, 2,3) = X(l,2) + X(2,3) +A*(3,1) = 6 . 

On the other hand C 4 has no 3-cliques and so TiC^) = 0 . By convention L\(0) = 
{0}. Hence curl : L\(E(C 4 )) —> L 2 ^(T[Ci)) must have curlA = 0 for all X G 
L\(E(Ca)) and in particular for the edge flow on the right of Figure [ 2 ] 


Table 2. Electrodynamics/fluid dynamics jargons 


NAME 

MEANING 

ALTERNATE NAME(s) 

divergence-free 

element of ker(div) 

solenoidal 

curl-free 

element of ker(curl) 

irrotational 

vorticity 

element of im(curl*) 

vector potential 

conservative 

element of im(grad) 

potential flow 

harmonic 

element of ker(Ai) 


anharmonic 

element of im(Ai) 


scalar field 

element of L 2 (V) 

scalar potential 

vector field 

element of L^(E) 



3.4. Helmholtz decomposition for graphs. The graph Laplacian Ao : 
L 2 {V)^L 2 (V), 

A 0 = — div grad = grad* grad, 

particularly under the context of spectral graph theory [2X1163]. has been an enor¬ 
mously useful construct, with great impact on many areas. We have nothing more 
to add except to remark that the Hodge decomposition of A 0 is given by (2.10), 

L 2 (V) = ker(A 0 ) ® im(div). 


Recall from (2.11) that ker(Ao) = ker(grad). Since grad/ = 0 iff / is piecewise 
constant, i.e., constant on each connected component of G, the number /3q(G) = 
dimker(Ao) counts the number of connected component of G — a well known fact 
in graph theory. 

The Hodge decomposition for the graph Helmholtzian Ai : L^(E) —> L^(E), 
Ai = — grad div + curl* curl = grad grad* + curl* curl. 


is called the Helmholtz decomposition. It says that the space of edge flows admits 
an orthogonal decomposition into subspaces 

ker(div) 
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and moreover the three subspaces are related via 

(3.7) ker(Ai) = ker(curl) n ker(div), im(Ai) = im(curl*) ® im(grad). 


In particular, the first equation is a discrete analogue of the statement “a vector 
field is curl-free and divergence-free if and only if it is a harmonic vector field.” 

There is nothing really special here — as we saw in Sectio n |2.3[ any matrices 
A and B satisfying AB = 0 would give such a decomposition: (|3.6|) and (3.71 are 


indeed just ( 2.6 1 , (2.4), and (2.9) where A = curl and B = grad. This is however a 
case that yields the most interesting applications (see Section [ 6 ] and [15114311. 

Example 3.2 (Beautiful mind problem on graphs). This is a discrete analogue 
of a probleir0 that appeared in a blockbuster movie: Let G = (V, E) be a graph. 
If A £ L? A {E) is curl-free, then is it true that A is a gradient? In other words, if 
X £ ker(curl), must it also be in im(grad)? Clearly the converse always holds by 


(3.2) but from (3.6), we know that 


(3.8) 


ker(curl) = ker(Ai) ® im(grad) 


and so it is not surprising that the answer is generally no. We would like to describe 
a family of graphs for which the answer is yes. 



Figure 3. Problem from A Beautiful Mind: V = {F : R 3 \ A —)• 
R 3 so V x F = 0}, W = {F = Xg}, dim (V/W) = ? 


The edge flow A' £ L\(E{C±)) on the right of Figure I is an example of one 
that is curl-free but not a gradient. It is trivially curl-free since T{C/f) = 0 . It is 
not a gradient since if X = grad /, then 

/(2) — /(!) = 2, /(3)-/(2)=2, /(4)-/(3) = 2, /(l) - /(4) = 2, 


and summing them gives ‘0 = 8 ’ — a contradiction. Note that A is also divergence- 
free by ( |3.4| ) since inflow A = outflow A. It is therefore harmonic by (3.71, i.e., 
A £ ker(A-|) as expected. 


^Due to Dave Bayer im now a ubiquitous homework problem in many multivariate calculus 
courses thanks to him and the movie. See Figure [S] 
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Every divergence-free edge flow on 64 must be of the same form as X, taking 
constant value on all edges or otherwise we would not have inflow X = outflow X. 
Since all edge flows on 64 are automatically curl-free, ker(Ai) = ker(div) and is 
given by the set of all constant multiples of X. The number 

Pi (G) = dimker(Ai) 

counts the number of ‘ 1 -dimensional holes’ of G and in this case we see that indeed 
Pi (64) = 1 . To be a bit more precise, the ‘ 1 -dimensional holes’ are the regions that 
remain uncovered after the cliques are filled in. 

We now turn our attention to the contrasting case of 63. Looking at Figure [l] 
it may seem that 63 also has a ‘1-dimensional hole’ as in 64 but this is a fallacy 
holes bounded by triangles are not regarded as holes in our framework. 

For C 3 it is in fact true that every curl-free edge flow is a gradient. To see 
this, note that as in the case of 64, any divergence-free X £ L 2 ^(E{C 3 )) must be 
constant on all edges and so 

(curl A)(l, 2 , 3 ) = A'(l, 2 ) +X( 2 , 3 ) + X( 3 , 1 ) = c + c + c = 3 c, 


for some c £ M. If a divergence-free X is also curl-free, then c = 0 and so X = 0 . 
Hence for 63, kei^Ai) = { 0 } by ( 3 . 8 ) and ker(curl) = im(grad) by ( 3 . 7 ). It also 
follows that P\(C 3 ) = 0 and so 63 has no ‘ 1 -dimensional hole’. 

What we have illustrated with 63 and 64 extends to any arbitrary graph. A 
moment’s thought would reveal that the property Pi{G) = 0 is satisfied by any 
chordal graph , i.e., one for which every cycle subgraph of four or more vertices 
has a chord , an edge that connects two vertices of the cycle subgraph but that is 
not part of the cycle subgraph. Equivalently, a chordal graph is one where every 
chordless cycle subgraph is 63. 






Figure 4. Left two graphs: not chordal. Right two graphs: chordal. 

There are however non-chordal graphs with Pi{G) = 0 . Take for example the 
wheel graph W n , formed by joining all n — 1 vertices of the cycle graph C n -\ to an 
extraneous nth vertex, n > 4. It is clear that P\{W n ) = 0 for all n > 4 but that 
only IV 4 is a chordal graph. For the wheel graphs W 3 , He■ W"V in Figure [ 5 j their 
cycle subgraphs 64,65, C 6 are evidently all chordless. 



Figure 5. Wheel graphs W 4 , W 5 , Wq, W 7 (left to right). 
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4. Higher order 

We expect the case of alternating functions on edges, i.e., fc = 1, discussed in 
Section[3]to be the most useful in applications. However for completeness and since 
it is no more difficult to generalize to k > 1, we provide the analogue of Section [3] 
for arbitrary fc here. 


4.1. Higher-order cochains. Let K(G) be the clique complex of a graph 


G = (V,E) as defined in Section 3.1 We will write K k = K k (G) for simplicity. 


/ 
(4.1) 


A k-cochain (or fc-form) is an alternating function on K k +i, or more specifically, 
V x • • • x V —> I where 


/(Mo), • • •, Mfc)) = sgn(cr)/(i 0 ,..., ik) 


for all {*o, • ■ •, ik} G Kk +1 and all cr € & k + i, the symmetric group of permutations 
on {0,.. ,,k}. We set f(i 0 ,... ,4) = 0 if {4, ■ • • ,4} ft K k+1 . 

Again, we may put an inner product on fc-cochains, 


z — / »o< -<Jfc 

with any positive weights satisfying Wi ff(0) ...i„ (fc) = w io ... ik for all cr e 6 fc+ i. 

We denote the resulting Hilbert space by L A (K k + 1 ). This is a subspace of 
L 2 (f\ k+1 V), the space of alternating functions with k + 1 arguments in V. Clearly, 

dim L\(Kk+i) = #K k+ 1 - 


A word of caution regarding the terminology: a k -cochain is a function on a 
(fc + l)-clique and has fc + 1 arguments. The reason is due to the different naming 
conventions — a (fc + l)-clique in graph theory is called a fc-simplex in topology. In 
topological lingo, a vertex is a 0 -simplex, an edge a 1 -simplex, a triangle a 2 -simplex, 
a tetrahedron a 3-simplex. 


4.2. Higher-order coboundary operators. The k-coboundary operators 5 k : 
L\[K k ) -A L 2 A (K k+ 1 ) are defined by 

fc +1 

(4.2) (Skf)(io, ■ ■ •,4+i) = ’ 4 - 1 ’ 4 +i> ■ ■ • ’ *fe+i)> 

3=0 

for fc = 0,1, 2,.... Readers familiar with differential forms may find it illuminating 
to think of coboundary operators as discrete analogues of exterior derivatives. Note 
that / is a function with fc + 1 arguments but S k f is a function with fc + 2 arguments. 
A convenient oft-used notation is to put a carat over the omitted argument 

(4.3) f(i 0 ,...,ij,...,ik+i) • 

The crucial relation AB = 0 in Section [2] is in fact 


(4.4) 


44-i = o, 


which is straightforward to verify using (4.2 1 (cf. Theorem B.4|. This generalizes 
(3.2) and is sometimes called the fundamental theorem of topology: 


The boundary of a boundary is empty. 


The reader is encouraged to envision any common geometric objects and see this 


for himself. The message expressed by ( 

4.4 

) is really the dual statement 


The coboundary of a coboundary is zero. 
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As in Section 2.1 (4.4) is equivalent to saying im(4-i) is a subspace of ker(4)- 
We define the fcth cohomology group of G to be the quotient vector space 

17 fc (G)=ker(4)/im(4-i), 


for k = 1,2,..., w(G) — 1. 

To keep track of the coboundary operators, it is customary to assemble them 
into a sequence of maps written in the form 


Ll 


(Ko) 


So, 


Ll 


(Ki 


Ll(K k ) ^ L 2 A (K k+ 1 ) 




Ll 




This sequence is called a cochain complex. 

For k = 1, we get 4 = grad, 4 = curl, and the first two terms of the cochain 
complex are 

L 2 (V) Ll (E) ^ L 2 a (T). 


4.3. Hodge theory. The Hodge k-Laplacian A k : Ll(K k ) —> L\{K k ) is de¬ 
fined as 

A fc = 6 k -i5l_ 1 + 5* k 8 k . 

We call / £ Ll(K k ) a harmonic k-cochain if it satisfies the Laplace equation 

A fc / = 0. 

Applying the results in Section [273] with A = 5 k and B = 8 k _ i, we obtain the 
unique representation of cohomology classes as harmonic cochains 

H k (G ) = ker(4)/im(4_!) ^ ker(4) n ker(4_ 1 ) = ker(A fc ), 

as well as the Hodge decomposition 

k e r(4 —i) 

L\{K k ) = im(4) © ker(A fc ) © im(4_i), 

S ---'v'- 

ker((5fe) 

and the relation 

im(Ai) = im(4) © im(4-i)- 


Example 4.1 (Graph isomorphism). Two undirected graphs G and H on n 
vertices are said to be isomorphic if they are essentially the same graph up to rela¬ 
beling of vertices. The graph isomorphism problem, an operj^] problem in computer 
science, asks whether there is a polynomial-time algorithm for deciding if two given 
graphs are isomorphic [5) ■ Clearly two isomorphic graphs must be isospectral in the 
sense that the eigenvalues (ordered and counted with multiplicities) of their graph 
Laplacians are equal, 

4(A 0 (G)) = 4(A 0 (IT)), * = 1,..., n, 

a condition that can be checked in polynomial time. Not surprisingly, the converse is 
not true, or we would have been able to determine graph isomorphism in polynomial 
time. We should mention that there are several definitions of isospectral graphs, 
in terms of the adjacency matrix, graph Laplacian, normalized Laplacian, signless 
Laplacian, etc. See [14H36| for many interesting examples. 


^Perhaps not for much longer — a significant breakthrough has been announced [6] as this 
article is going to press. 
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The reader may perhaps wonder what happens if we impose the stronger re¬ 
quirement that the eigenvalues of all their higher-order Hodge fc-Laplacians be equal 
as well? 

Aj(A fc (G)) = A*(A k(H)), i = l,...,n, k = 0,...,m. 

For any m > 1, these indeed give a stronger set of sufficient conditions that can be 
checked in polynomial time. For example, the eigenvalues of Ao for the two graphs 
in Figure [d] are 0,0.76,2,3,3,5.24 (all numbers rounded to two decimal figures). 
On the other hand, the eigenvalues of Ai are 0,0.76, 2, 3,3,3, 5.24 for the graph on 
the left and 0, 0, 0.76, 2,3, 3, 5.24 for the graph on the right, allowing us to conclude 
that they are not isomorphic. These calculations are included in Section 0 



Figure 6 . These graphs have isospectral Laplacians (Hodge 0- 
Laplacians) but not Helmholtzians (Hodge 1-Laplacians). 


Non-isomorphic graphs can nevertheless have isospectral Hodge Laplacians of 
all order. The two graphs in Figure [7] are clearly non-isomorphic. Neither contains 
cliques of order higher than two, so their Hodge /c-Laplacians are zero for all k > 2. 
It is straightforward to check that the first three Hodge Laplacians Ao, Ai, A 2 , of 
both graphs are isospectral (cf. Section 0- 



Figure 7. Non-isormorphic graphs with isospectral Hodge k- 
Laplacians for all k = 0,1,2,.... 


5. Further information 

5.1. Topological caveats. The way we defined cohomology in Section [2T] is 
more or less standard. The only simplification is that we had worked over a field. 
The notion of cohomology in topology works more generally over arbitrary rings 
where this simple linear algebraic approach falls short, but not by much — all we 
need is to be willing to work with modules over rings instead of modules over fields, 
i.e., vector spaces. Unlike a vector space, a module may not have a basis and we 
may not necessarily be able to represent linear maps by matrices, a relatively small 
price to pay. 
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Table 3. Topological jargons (second pass) 


NAME 

MEANING 

coboundary maps 

&k '■ (Kk) -A- L^(Kk+i) 

cochains 

elements of L^(I\ k ) 

cochain complex 

-» Ll(K k _ i) ^ Ll(K k ) ^ L*(K k+1 ) —> • • • 

cocycles 

elements of ker(d*.) 

coboundaries 

elements of im(<5fc_i) 

cohomology classes 

elements of ker { 8 k)/ im(Jfc-i) 

harmonic cochains 

elements of ker(Afc) 

Betti numbers 

dimker(Afc) 

Hodge Laplacians 

A k — 5k-l^k-i + 5 k 5k 

/ is closed 

?r 

II 

O 

/ is exact 

/ = 4-iff for some g G L 2 A (K k -i) 

/ is coclosed 

* 

1 

II 

O 

/ is co exact 

/ = Skh for some h G L\{K k + 1 ) 

/ is harmonic 

A k f = 0 


However the further simplifications in Sections 2.2 and |2.3| to avoid quotient 
spaces only hold when we have a field of characteristic zero (we chose R). For 
example, if instead of R, we had the field F 2 of two elements with binary arithmetic 
(or indeed any field of positive characteristic), then we can no longer define inner 
products and statements like ker(H) ± = im(B*) make no sense. While the adjoint 
of a matrix may still be defined without reference to an inner product, statements 
like ker(A*A) = ker(A) are manifestly false in positive characteristic. 

We saw in Section [2T| that in the way we presented things, there is no difference 
between cohomology and homology. This is an artifact of working over a field. In 
general cohomology and homology are different and are related via the universal 
coefficient theorem m 

From the perspective of topology, the need to restrict to fields of zero charac¬ 
teristic like R and C is a big shortcoming. For example, one would no longer be 
able to ‘detect torsion’ and thereby perform basic topological tasks like distinguish¬ 
ing between a torus and a Klein bottle, which is a standard utility of cohomology 
groups over rings or fields of positive characteristics. 


Another technical reason against the Hodge-theoretic approach in Sections 2.2 
and |2.3| is that it is not functorial. 


However, if one is primarily interested in 
engineering and scientific applications, then it is our belief that our approach in 
Sections ii and [4] is adequate. Despite our restriction to clique complexes of 
graphs, our discussions in Section [4] apply verbatim to any simplicial complex. 

We should add that although we did not discuss it, one classical use of coho¬ 
mology and Hodge theory is to deduce topological information about an underlying 
topological space. Even over a field of characteristic zero, if we sample sufficiently 
many points V from a sufficiently nice metric space f2, and set G = ( V ., E ) to be an 
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appropriately chosen nearest neighbor graph, then 

j3 k(G) = dim H k {G) = dimker(Afc) 


gives the number of ‘fc-dimensional holes’ in SI. While the kernel or O-eigenspace 
captures qualitative topological information, the nonzero eigenspaces often capture 
quantitative geometric information. In the context of graphs [2Hf63], this is best 
seen in A 0 — its O-eigenpair tells us whether a graph is connected (/3o{G) gives the 
number of connected components of G, as we saw in Section 3.4) while its smallest 
nonzero eigenpair tells us how connected the graph is (eigenvalue by way of the 
Clieeger inequality and eigenvector by way of the Fiedler vector). 


5.2. More linear algebra. Since V is a finite set, L 2 (V), L A (E), L A (T) are 
finite-dimensional vector spaces. We may choose bases on these spaces and get 

L 2 (V)^W, L 2 A (E)^R n , L 2 A (T)^R m 

where p , n, m are respectively the number of vertices, edges, and triangles in G. 
Nevertheless, a more natural way would be to regard L 2 A {E) as an n-dimensional 
subspace of skew-symmetric matrices 

{ { x ij\ F i, 3 =i e K pxp : Xij = —Xji, Xij = 0 whenever {i,j} & E}, 

and L A (T) as an m-dimensional subspace of skew-symmetric hypermatrices 48 


{^ ^ ^ ' ^Pijk — ‘pjki — ^Pkij — Pjik — Pikj — Pkjii 

fijk = 0 whenever {i,j, k} ^ Tj. 

See also Section [C] for examples of how one may in practice write down matrices 
representing fc-coboundary operators and Hodge fc-Laplacians for k = 0,1, 2. 


5.3. Continuous analogue. The continuous analogue of Section [3] where the 
graph G is replaced by a region f1 in R n could be illuminating for readers who 
are familiar with partial differential equations and physics. This section requires 
nothing more than multivariate calculus. 

The Laplace or homogeneous Poisson equation is the PDE 

s 2 / a 2 / a 2 / 

dx 2 dy 2 dz 2 

over a domain fl C R 3 , or more generally, 


(5.1) 


n f)2 f 

- A 0 f = -V 2 / = -V • V/ = div grad / = Y ^-L = 0 


i -1 


over a domain H C R". 

The Laplace equation is ubiquitous in physical phenomena, we find it in electro¬ 
statics, describing electric potential in free space with no charge; in fluid mechanics, 
describing the velocity potential of an incompressible fluid; in thermal conduction, 
as the stationary heat equation without a heat source. Furthermore, it tells us a 
great deal about the topology and geometry of the domain H [44]. 

To ensure uniqueness of solution, one would usually study the Laplace equation 
with some boundary conditions, say, 


A of = 0 in H, 

/ = g on dfl 


or 


A 0 f = 0 in SI, 
df/dv = g on dSl. 
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O-cohomolgy may be viewed as the study of solutions to Laplace equation with 
no boundary conditions. A 0-cohomology class is a harmonic function, i.e., a scalar 
field / : f2 —► 1R that satisfies (5.1l. The Oth cohomology group is then the set of 
all harmonic functions or harmonic 0-forms, 

H°(Sl) = ker(A 0 ) = {/ : fi -i 


A 0 / = 0}. 


There is also the vector Laplace or homogeneous vector Poisson equation in 
R 3 , usually stated with some boundary conditions: 

{ — grad div / + curl curl / = 0 in Q, 
f ■ v = 0, curl / x v = 0 on dfl. 

If we drop the boundary conditions, we get 


(5.2) 


A if = V(V ■ /) — V x (V x /) = curl curl / — grad div / = 0, 


where Ai is the Helmholtz operator or vector Laplacian in R 3 . 

1-cohomolgy may be viewed as the study of solutions to the vector Laplace 
equation with no boundary conditions. A 1-cohomology class is a harmonic 1-form, 
i.e., a vector field / : Q —> R 3 that satisfies (5.21. The 1st cohomology group is 
then the set of all harmonic 1-forms, 


H\n) = ker(Ai) = {/ : Q -> R 3 : Ai/ = 0}. 


5.4. Computations. One particular nice feature of the Hodge decomposition 
is that it can be efficiently computed by solving least squares problems. For exam¬ 


ple, to compute the decomposition in (3.7) for any given X £ L^(E), we may solve 
the two least squares problems 


min II grad f — X \ 
feL 2 (v) 6 


and min || curl* $ — X||, 

t>£Ll(T) 


to get Xh as X — grad / — curl* $. Alternatively, we may solve 


min HAiY — A|| 

V'c/./(/•:) 

for the minimizer Y and get Xh as the residual X — AiY directly. Once we obtain 
AiF, we may use the decomposition ( |2.9| ), 

im(Ai) = im(grad) ® im(cuiT), 

AiY = grad/ + curl* $, 

and solve either 

min || grad f — AiY|| or min || curl* $ — AiY|| 

/ei 2 (v)" *€Ll(T)" 

to get the remaining two components. 

We have the choice of practical, efficient, and stable methods like Krylov sub¬ 
space methods for singular symmetric least squares problems [191120] or specialized 
methods for the Hodge 1-Laplacian with proven complexity bounds [23] . 
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6. Applications 

Applications of cohomology and Hodge theory are plentiful, we find them in 
numerical analysis [3], peridynamics [30], topological data analysis 126] . compu¬ 
tational topology | 33l . graphics [ 66] . image processing GS], robotics |45j . sensor 
networks |65j . neuroscience [53] . and many other areas in physical science and 
engineering. But these applications are not ‘surprising’ in the sense that they all 
concern physics, geometry, or topology — areas that gave birth to cohomology and 
Hodge theory in the first place. 

What we find somewhat unexpected are recent applications of cohomology and 
Hodge theory to game theory ]15j and ranking 143]. It is not at first obvious 
how cohomology and Hodge theory can play a role in these areas, which are less 
structured compared to applications arising from manifestation of physical laws (e.g. 
Maxwell’s equations or Navier-Stokes equations) naturally amenable to cohomology 
and Hodge theory. We will briefly describe these in the next two sections. 

The headings game theory and ranking provide basic frameworks that underlie 
other data analytic or information theoretic applications such as ad auction, social 
networks (for the former), web search, recommendation systems, crowd sourcing 
(for the latter). Other recent works along these lines may be found in [161 28, [40 j, 
[56l .69! . Cohomology and Hodge theory are more widely applicable than they seem 
at first. 


6.1. Rankings. The application to ranking in [43] involves nothing more than 
attaching ranking-theoretic meanings to the mathematical objects in Section [3] 
There is a very readable account of this in [4]. 

Suppose V is a set of alternatives to be ranked and EC (^) is the set of pairs 
of alternatives that have been compared. The degree of preference of an alternative 
i over another alternative j is then naturally quantified as X(«, j), since if X(i, j) is 
the amount one favors i over j, then it is natural to assume that X(j, i) = —X(i,j) 
is the amount one disfavors j over i. This is the only assumption required of the 
method. 

We call the resulting edge flow X £ L\{E) a pairwise ranking. If there is more 
than one voter, X may be computed by aggregating over all voters using a variety 
of rank aggregation methods that are standard in Statistics 


The Helmholtz 


decomposition ( 2.61 gives 

L\{E) = im(grad) ® ker(Ai) ® im(curT) 
X = grad / + Xh + curl* <I>, 


( 6 . 1 ) 


may then be interpreted as 

pairwise ranking = consistent + globally inconsistent + locally inconsistent. 


Inconsistency refers to situations where we have circular preference relations like 
* >- j k i. Consistency means an absence of inconsistencies. 

The consistent component Xp = — grad / yields a scor^] function / £ L 2 (V) 
that allows alternatives to be globally ranked: i y j iff f{i) > f(j). The total size of 
the two inconsistent components (measured by their norms) provides a ‘certificate 
of reliability’ for the global ranking obtained if the sum of the inconsistent 
components is large relative to the consistent component, then the global ranking 


7 A score function is a potential function, cf. in the context of ranking. 
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obtained is unreliable. This method of obtaining a global ranking of alternatives 
via the Helmholtz decomposition is called HodgeRank. 

The further splitting of the inconsistent component into local and global parts 
gives finer details about the nature of the inconsistencies. Roughly speaking, the 
local inconsistent component curl <f> measures inconsistencies among items ranked 
closed together whereas the global inconsistent component Xh measures inconsis¬ 
tencies among items ranked far apart. One may view the two inconsistent com¬ 
ponents as a quantification of various conundrums in voting theory: Condorcet 
paradox, Arrow’s impossibility, McKelvey chaos, etc 158 . 

In this context, other quantities that we discussed in Section [3] also have 
ranking-theoretic meanings. For example, assuming we use the standard Z 2 -inner 
products, the divergence of a pairwise ranking X £ L^(E) is a function divX £ 
L 2 (V) and for each alternative i £ V , 

(di vl)(i)=^. X{i,j) 

gives a generalization of the Borda count |25L 58] . In particular, divX(i) = 0 
implies that the alternative i is preference-neutral. 

Table [3] provides an illustration, where a variant of HodgeRank [351 is applied 
to the dataset from the 2006 Netflix Prize competition m to obtain a global 
ranking of the 17,770 movies]^ (the alternatives) based on 100,480,507 ratings given 
by 480,189 viewers (the voters). The middle and right columns show the top 15 
results obtained when HodgeRank is used in conjunction with the log-odds model 
and linear model for rank-aggregation. The left column shows the ranking obtained 
via simple averaging of the ratings; the shortcoming of such an approach is obvious 
— if Movie X gets one single 5-star rating and Movie Y gets 10,000 5-star ratings 
and one single 1-star rating, ranking by average ratings would put Movie X ahead of 
Movie Y. We call such ratings imbalanced. The results shown here already includes 
a mitigating factor whereby infrequently rated movies are filtered out beforehand 
but the imbalanced effect is still visible in the left column. One reason the effect is 
not as pronounced in HodgeRank is that it uses an inner product with 


Wij = Wji = number of voters who have compared alternatives i and j 


for the weights in (3.11 to correct for such imbalanced ratings. 


6.2. Games. Consider a game with n players, indexed 1,..., n. We will define 
a graph G = (R, E) as follows. The vertex set R is a finite set of the form 

V = Si x • • • x S n 

where S, is a finite set of all possible strategies for the ith player. An element 
(si,..., s n ) £ V is called a strategy profile where Sj £ Si denotes the strategy of the 
zth player. Once we have specified V, the edge set E C (])) is completely determined 
in the following way: for any pair of vertices (si,..., s n ), (ii,..., t n ) £ V, 

(6.2) {(si,..., s n ), (t i,..., t n )} £ E iff there exists exactly one i where s* ^ U. 

In other words, E comprises all pairs of strategy profiles that differ only in the 
strategy of a single player. Note that in this application E cannot be arbitrary but 
is completely determined by R. 


’LOTR III appears twice because of the two DVD editions, theatrical and extended. 
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Table 4. Top 15 movies in the Netflix dataset generated by a 
variant of HodgeRank (middle and right). For comparison, left 
column is a ranking via the mean ratings of the movies. (Repro¬ 
duced verbatim from [ 551 Table 2, p. 62]) 


MEAN RATINGS 

HodgeRank / log-odds 

HodgeRank/linear 

LOTR III: Return ... 

LOTR III: Return .. . 

LOTR III: Return ... 

LOTR I: The Fellowship ... 

LOTR I: The Fellowship ... 

LOTR I: The Fellowship ... 

LOTR II: The Two ... 

LOTR II: The Two ... 

LOTR II: The Two ... 

Lost: Season 1 

Star Wars V: Empire ... 

Lost: SI 

Battlestar Galactica: Si 

Raiders of the Lost Ark 

Star Wars V: Empire ... 

Fullmetal Alchemist 

Star Wars IV: A New Hope 

Battlestar Galactica: SI 

Trailer Park Boys: S4 

Shawshank Redemption 

Star Wars IV: A New Hope 

Trailer Park Boys: S3 

Star Wars VI: Return ... 

LOTR III: Return ... 

Tenchi Muyo! ... 

LOTR III: Return ... 

Raiders of the Lost Ark 

Shawshank Redemption 

The Godfather 

The Godfather 

Veronica Mars: SI 

Toy Story 

Shawshank Redemption 

Ghost in the Shell: S2 

Lost: SI 

Star Wars VI: Return . .. 

Arrested Development: S2 

Schindler’s List 

Gladiator 

Simpsons: S6 

Finding Nemo 

Simpsons: S5 

Inu-Yasha 

CSI: S4 

Schindler’s List 


Once we have specified V and therefore E, a game on the graph G = (V, E) is 
determined by the utility function ft £ L 2 (V) of the ith player, i = 1,... ,n. We 
write F := (/j, ..., /„). So F : V —» is a vector field on V and we will call it 

the utility vector field of the game. We denote the set of all vector fields by 

L 2 (V ; IT) := {F : V -+ R n : f u ...,/„ € L 2 (V)} * L 2 {V) © • • ■ ® L 2 (V). 

Since every game on G is specified by a utility vector field, L 2 (V ; R n ) parameterizes 
all games on G. 

A potential game is one where the utility vector field F = (/i,..., f n ) satisfies 

grad/i = • • • = grad f n . 

This condition implies that /,; and fj differ at most by an additive constanlj^] It 
may be interpreted to mean that the utilities of all players are aligned with a global 
objective. Such games were first proposed in [541 and are widely studied in game 
theory. They are easy to analyze and have pure Nash equilibria , i.e., strategy profiles 
(si,..., s n ) £ V such that 

fi(Sll • • • 7 7 ■ • • 5 &n) A fi(s 1, • • • 7 , S n ) 

for every s' £ Si and for each i = 1 ,,n. 

A harmonic game is one where the utility vector field F = (/ 1; ..., /„) satisfies 

Ao(/i + • • • + f n ) — 0, 

i.e., fi + • • • + f n £ ker(Ao) is a harmonic function. These were first defined in 
jl~5] based on similar earlier ideas }42j . Such games generically have no pure Nash 

®By the definition of K. G is connected and so grad / = 0 implies / is a constant function. 







22 


LEK-HENG LIM 


equilibrium and may be regarded essentially as sums of cycles [15] . A well-known 
simple example would be the rock-paper-scissors game. 

We define subspaces of utility vector fields corresponding to potential and har¬ 
monic games respectively: 

P = {F £ P 2 (E;M") : grad fl = grad fj for all i,j = 1,... , n}, 

H = {F £ L 2 (V; R") : A 0 (/ 1 + • ■ ■ -f f n ) = 0}, 

C = {F £ L 2 (V;R n ) : fi ,..., /„ constant functions}. 

As before, F = (/i,..., /„). C is the set of constant vector fields and it is easy to 
see that Pnlf = C. 

The Jacobian is Jac : L 2 (V;]R") —► T 2 (P;R n ) takes F £ P 2 (F;R n ) to 
JacF = (grad/i,..., grad /„), 

in L 2 a {E\ R”) := {(Xl, ..., X n ) : X 1: ..., X n £ L A (E)}. In the context of games, 
an edge flow in L 2 (P) is called a game flow and an element of L 2 A {E\ M") is a 
vector Held of game flows. The image of the spaces P, H, C under the Jacobian are 
respectively, 

Jac(P) = {(X,... ,X) £ L 2 (E;R n ) : X £ L 2 (E)j, 

Jac(P) = {(Xi,..., X n ) £ P 2 (P; R") : div(Xi + ■ • • + X n ) = 0}, 
and Jac(C) = {0}. 

In 115] , the reader would find a decomposition of games into potential, har¬ 
monic, and nonstrategic components, induced by the Helmholtz decomposition of 
game flows. The details are a bit too involved for this article and we would in¬ 
stead just describe the latter. The main distinction is that a decomposition of 
games is a decomposition of L 2 (H;R n ) whereas a decomposition of game flows is a 
decomposition of P 2 (E). 

A departure from the general Helmholtz decomposition is that game flows are 
always curl-free, i.e., always in ker(curl). So the Helmholtz decomposition of game 
flows takes the form 

ker(curl) = im(grad) © ker(Ai), 

X = X P + X H , 
with the following interpretation, 

game flow = potential game flow + harmonic game flow. 

Example 6.1 (Road sharing game). We discuss a beautiful three-player game 
with nonzero potential and harmonic components first given in (l5'j . In this game, 
three players C, R 1 P each chooses one of two roads a, b. 

The scenario is the immediate aftermath of a bank robbery where a policeman 
(player P) is in pursuit of the bank robber (player R). Naturally the policeman 
wants to be on the same road as the robber while the robber wants to avoid that. 
The commuter (player C ), an innocent bystander, does not want collateral damage 
by being on the same road as either the robber or the policeman. 

The respective utility functions, or rather, changes to the values of the utility 
functions (i.e., payoffs), are described as follows: 

• commuter’s payoff decreases by 2 with each other player sharing his road; 

• robber’s payoff is —1 if the policeman shares his road and 0 otherwise; 

• policeman’s payoff is exactly the negative of the robber’s payoff. 
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Here Sc = Sr = Sp = {a, b} and the set of strategy profiles is therefore 
V = {(sc, SR, sp) : SC £ Sc, SR £ Sr, Sp £ Sp} 

= {(a, a, a), (a, a, b ), (a, b, a), (a, b , b ), (b , a, a), ( 6 , a, b), (b, b, a), ( b , b, 6 )}. 


E is uniquely determined by V according to (6.2) and comprises pairs of strategy 


profiles that differ only in the strategy of a single player: 

{(a, a, a), (a, a, b)j, {(a, a, a), (a, b, a)}, {(a, a, a), (b, a, a)}, {(a,a,b), (a,b, b)j, 
{(a,a,b),(b,a,b)}, {(a,b,a), (a, b,b)j, {(a, b, a), (b, b, a)}, {(b,a, a), (b, a,b)}, 
{(b,a,a),(b,b,a)}, {(a, b, b), (b, b, b)}, {{b,a,b),(b,b,b)}, {{b,b,a),(b,b,b)}. 

The Helmholtz decomposition of this game is shown in Figure [ 8 ] An edge flow 
between two vertices (i.e., strategy profiles) is represented by an arrow and a weight 
the direction of the arrow represents an improvement in the payoff of the player 
who changes its strategy; the amount of improvement is given by the corresponding 
weight. 


(6, b , a) 


( b , a, a) 


(a, 6, a) 


(a, a, a) 


(6, b , a) ■ 


(6, a, a) 


(a, a, a) 


(6, a, b) 


(a, 6, a) 


(a) flow of road-sharing game 


(6, a, 6) 


(6,6, b) 


(a, 6,6) 


(6, a, a) 


(a, a, 6) 


(a, a, a) 


( 6 , 6 , 6 ) 


(a, 6,6) 


(a, a, 6) 


(6,6, a) 


(a, 6, a) 


(6, a, b) 


( 6 , 6 , 6 ) 


(a, 6,6) 


(a, a, 6) 


(b) potential component 


(c) harmonic component 


Figure 8 . Helmholtz decomposition of the road sharing game. 
(Reproduced with slight modification from [15( Figure 1, p. 475]) 


Observe that the game flow X £ L\{E) shown in Figure 8a is indeed curl- free 
since T = 0 and so L\{T) = {0}. The potential component Xp in Figure 8b 


flows out of sources (a, a, a) and ( b , b, b ) into sinks (a, b , b) and ( b, a, a). An explicit 
potential function / £ L 2 (V) is easy to construct, for example, 

/(a, a, a) = f(b, b, b ) = 1, /(a, b, b) = f(b, a, a) = -1, 

/(a, a, b) = f(a, b, a) = f(b, a, b) = f(b, b , a) = 0 . 
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Check that Xp = — grad /, as in (3.51. 

The harmonic component Xp 
cyckf^l that goes from 

(a, a, a) —)• (6, a, a) —> ( b , 6, a) - 


£ L\{E) shown in Figure 8c is evidently a 


( b , 6, 6) —► (a, 6,6) —> (a, a, 6) —» (a, a, a) 


in a loop. The divergence of Xp is easily seen to be zero by ( |3.4[ ) since the inflow 
equals the outflow at every vertex. Given that X and therefore Xp is in ker(curl), 
we conclude that Xp £ ker(curl) n ker(div) = ker(Ai) by (3.7), i.e., it is indeed a 
harmonic 1-cochain. 


7. Nonlinear Laplacians on a graph 


In this last section, we will introduce another generalization of the graph 
Laplacian that is also constructed out of the coboundary operators in Section |3.3| 
Whereas the Hodge Laplacians may be viewed as a higher order generalization 
of the graph Laplacian, acting linearly on functions of higher order cliques as op¬ 
posed to functions of vertices (i.e., 1-cliques), the generalization in this section takes 
things in a different direction — they act on functions of vertices but are nonlinear 
operators. 

The nonlinear p-Laplacians are defined on open subsets of K” [32l 49] by 
(7.1) L p f = -div(|| grad/f 2 grad/) 


for 1 < p < oo, and 

for p = l,oo respectively. Here grad/ = V/ is taken to be a column vector and 
II • II refers to the usual Euclidean norm on R ra , i.e., 


Loaf = —(grad /) T (grad || grad/|| 2 ) 


|grad/|| 2 =^ 


V dx;) 


These are all nonlinear operators with the exception of the case p = 2, which is 
a linear operator. The nonlinear 1-Laplacian is sometimes also called the mean 
curvature operator. 

For 1 < p < oo, the nonlinear p-Laplacians on a graph G = (V, E) are defined 
by the same formulas in (7.1) and (7.2) except that the gradient and divergence 
operators div and grad should now be taken to be the coboundary operators in- 

In particular, the (non)linear 2-Laplacian is exactly the 


traduced in Section 3.3 


Hodge O-Laplacian defined in Section |4.3[ 

L 2 = — div grad = Aq, 


which is of course the usual graph Laplacian. Note that as before, our graphs are 
undirected and have no loops or multiple edges 

For 1 < p < 2, L p has a singularity whenever grad / vanishes. To define it 
everywhere we borrow some ideas from nondifferentiable optimization i|22j. For 
p = 1, the nonlinear 1-Laplacian [38] on a graph is given by 

Lif = - div(sgn(grad /)), 


■^Or more accurately, a cocycle, since we are talking about functions, 
form a 1-cycle but functions on them form 1-cocycles. See Tables ^ and [ 3 ] 


The directed edges 
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essentially obtained from (7.2) by replacing x/ 
is a set-valued function defined on R by 


|x|| with sgn(x). 


The function sgn 


f —1 if x < 0 , 
sgn(s) = < [— 1 , 1 ] if x = 0 , 
[+1 if x > 0 , 

and extended to R n coordinatewise, i.e., 

sgn(s) = (sgn(xi),..., sgn(a; n )) 


for x = (xi,... ,x n ) £ R n . This is the subdifferential of the 1-norm on R”, i.e., 
<9||x||i = sgn(x). 

An example of an instance where the nonlinear 1-Laplacian appears is the 
following. In spectral graph theory, a well-known fundamental result is that the 
isoperimetric or Cheeger constant, 

l(r)= . \E(S,V\S)\ 

{ J scv min(Vol(5), Vol(V \ S)) ’ 
of a connected graph G satisfies II the Cheeger inequality 

^A 2 (L 2 ) < h{G) < y/2X 2 (L 2 ), 

where A 2 (L 2 ) is the second eigenvalue of the usual graph Laplacian. But what if 
we want equality? The answer, as demonstrated in [38IH81. is that if we replace 
1*2 by Li, then we get 

A 2 (L 1 ) = h(G). 

We will end this section with a few words about applications. The continuous 
nonlinear 1-Laplacian, usually under its better-know name ‘mean curvature oper¬ 
ator,’ plays a central role in the Level Set Method |55j . In a similar spirit, the 
discrete version has also been found [461 to have a game-theoretic interpretation 
the so-called balancing games in R n [62) may be regarded as a discrete motion 
by mean curvature. A simple example of a balancing game is as follows: Player I 
chooses a direction, i.e., a unit vector Xi £ R", and Player II chooses whether to 
flip it, i.e., £i £ {— 1 , + 1 }, in a way so that ||]P* =1 s i x i\\ is minimized at step t. 


8. Conclusion 

Traditional applied mathematics largely involves using partial differential equa¬ 
tions to model physical phenomena and traditional computational mathematics 
largely revolves around numerical solutions of PDEs. 

However, one usually needs substantial and rather precise knowledge about a 
phenomenon in order to write it down as PDEs. For example, one may need to 
know the dynamical laws (e.g. laws of motions, principle of least action, laws of 
thermodynamics, quantum mechanical postulates, etc) or conservation laws (e.g. 
of energy, momentum, mass, charge, etc) underlying the phenomenon before being 
able to ‘write down’ the corresponding PDEs (as equations of motion, of continuity 
and transfer, constitutive equations, field equations, etc). In traditional applied 
mathematics, it is often taken for granted that there are known physical laws behind 
the phenomena being modeled. 

In modern data applications, this is often a luxury. For example, if we want to 
build a spam filter, then it is conceivable that we would want to understand the ‘laws 
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of emails.’ But we would quickly come to the realization that these ‘laws of emails’ 
would be too numerous to enumerate and too inexact to be formulated precisely, 
even if we restrict ourselves to those relevant for identifying spam. This is invariably 
the case for any human generated data: movie ratings, restaurant reviews, browsing 
behavior, clickthrough rates, newsfeeds, tweets, blogs, instagrams, status updates 
on various social media, etc, but the perhaps surprising thing is that it also applies 
to many data sets from modern biology and medicine. 

For such data sets, all one has is often a rough measure of how similar two 
data points are and how the dataset is distributed. Topology can be a useful tool 
in such contexts m since it requires very little — essentially just a weak notion 
of separation, i.e., is there a non-trivial open set that contains those points? 

If the data set is discrete and finite, which is almost always the case in appli¬ 
cations, we can even limit ourselves to simplicial topology, where the topological 


spaces are simplicial complexes (cf. Section 3.1). Without too much loss of gener¬ 


ality, these may be regarded as clique complexes of graphs (cf. Section 5.1): data 


points are vertices in V and proximity is characterized by cliques: a pair of data 
points are near each other iff they form an edge in E, a triplet of data points are 
near one another iff they form a triangle in T, and so on. 

In this article, we have undertaken the point of view that 


graphs are discrete Riemannian manifolds 


and that cohomology is a much-relaxed discrete analogue of PDEs (cf. Sections 3.3 


5.3). Furthermore, standard partial differential operators on Riemannian manifolds 
- gradient, divergence, curl, Jacobian, Hessian, Laplace and Helmholtz operators, 
Hodge and nonlinear Laplacians — all have natural counterparts on graphs (cf. 
Sections |3j [4j [7| with useful roles in data applications (cf. Section |6j) . An example 
of a line of work that carries this point of view to great fruition is 
Also, in this article we have only scratched the surface of cohomological and Hodge 
theoretic techniques in graph theory; see m for results that go much further. 

In traditional computational mathematics, discrete PDEs arise as discretiza¬ 
tion of continuous PDEs, intermediate by-products of numerical schemes. But in 
data analytic applications, discrete PDEs tend to play a more central and direct 
role. Despite these differences, the discrete partial differential operators on graphs 
introduced in this article may perhaps serve as a bridge on which insights from tra¬ 
ditional applied and computational mathematics could cross over and be brought 
to bear on modern data analytic applications. 
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Appendix A. Numerical Linear Algebra Refresher 

By numerical linear algebra , we mean linear algebra over any subfield of C. We 
will provide routine proofs for some numerical linear algebraic facts that we have 
used freely in Section [2j We will work over R for convenience. 

Theorem A.l. Let A £ R mxn . Then 
® ker(A*A) = ker(A), 

© im (A* A) = im (A*), 

© ker(A*) = im(A)- L , 

@ im(A*) = ker(A)- 1 -, 

© R” = kcr(A) © im(A*). 

Proof. 

© Clearly ker(A) C ker(A*A). If A*Ax = 0, then ||Ax’|| 2 = x*A*Ax = 0, so 
Ax = 0, and so ker(A*A) C ker(A). 

@ Applying rank-nullity theorem twice with ©, we get 

rank(A*A) = n — nullity (A* A) 

= n — nullity (A) = rank(A) = rank(A*). 

Since im(A*A) C im(A*), the result follows. 

® If x £ im(A)- L , then 0 = (x, Ay) = (A*x , y ) for all y £ R n , so A*x = 0. If 
x £ ker(A*), then ( x,Ay) = ( A*x,y ) = 0 for all y £ R n , so x € im(A)- 1 -. 

© By ®, im(A*)- L = ker(A**) = ker(A) and result follows. 

© R n = ker(A) ® ker(A)- 1 - = ker(A) © im(A*) by ©. □ 

Our next proof ought to convince readers that the Hodge decomposition theo¬ 
rem is indeed an extension of the Fredholm alternative theorem to a pair of matrices. 
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Theorem A.2. Let A £ R mxn and B £ R nxp with AB = 0. Then 
© ker (A*A + BB*) = ker(A) n ker(B*), 

® ker(A) = im(i?) ® ker {A* A + BB*), 

® ker (B*) = im [A*) ® ker(A*A + BB*), 

© R n = im(A*) ® ker {A* A + BB*) ® im(B), 

® im (A* A + BB*) = im(A*) ® im (B). 


Proof. Note that im(i?) C ker(A) as AB = 0, im(A*) C ker(£?*) as B*A* = 0. 
© Clearly ker (A) n ker(H*) C ker (A* A + BB*). Let x £ ker {A* A + BB*). 
Then A*Ax = —BB*x. 

• Multiplying by A, we get AA* Ax = — ABB*x = 0 since AB = 0. So 
A*Ax £ ker(A). But A*Ax £ im(A*) = ker(A)- 1 - by ©. So A*Ax = 0 
and x £ ker(A*A) = ker(A) by ©. 

• Multiplying by B*, we get 0 = B* A*Ax = -B*BB*x since B*A* = 
0. So BB*x £ ker(B*). But BB*x £ im(i?) = ker(S*)- L by ©. So 
BB* x = 0 and x £ ker (BB*) = ker(B*) by ©. 

Hence x £ ker(H) D ker(H*). 

® Applying ©to B*, 

ker(A) = K" D ker(A) = [ker(H*) ® im(H)] D ker(A) 

= [ker(H*) fl ker(A)] ® [im(5) fl ker(A)] 

= ker (A* A + BB*) © im(H), 

where the last equality follows from © and irn(H) C ker(A). 

© Applying ©, 

ker(H*) = K" n ker(B*) = [ker(A) © im(A*)] fl ker(B*) 

= [ker(A) fl ker(H*)] © [im(A*) fl ker(H*)] 

= ker(A* A + BB*) © im(A*), 

where the last equality follows from © and im(A*) C ker(H*). Alterna¬ 
tively, apply ® with B*, A* in place of A, B. 

© Applying © to B* followed by ®, we get 

R n = ker(H*) © im(H) = im(A*) © ker(A*A + BB*) © im(H). 

® Applying © to A* A + BB* , which is self-adjoint, we see that 

im(A*A + BB*) = ker(A*A + BB*) ± = im(A*) © im(H), 

where the last equality follows from @. □ 


Any two vector spaces of the same dimension are isomorphic. So saying that 
two vector spaces are isomorphic isn’t saying very much — just that they have the 
same dimension. The two spaces in (2.12) are special because they are naturally 
isomorphic , i.e., if you construct an isomorphism, and the guy in the office next door 
constructs an isomorphism, both of you would end up with the same isomorphism, 
namely, the one below. 


Theorem A. 3. Let A £ and B £ K nxp with AB = 0. Then the following 

spaces are naturally isomorphic 

ker (A)/im (B) = ker(A) fl ker(B*) = ker(i3*)/im(A*). 
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PROOF. Let 7 r : R n —► in^B) 1 - be the orthogonal projection of R n onto the 
orthogonal complement of im(B). So any x £ R" has a unique decomposition into 
two mutually orthogonal components 

R n = irn (B) 1 - © im(B), 
x = tt(x) +(1 — 7 r)(a;). 

Let 7^4 be 7 r restricted to the subspace ker(A). So any x £ ker(A) has a unique 
decomposition into two mutually orthogonal components 

ker(A) = (ker(.A) nin^B)- 1 -) © im(B), 
x = it a {x) + (1 - ir A )(x), 

bearing in mind that ker(A) n irn(B) = im(B) since im(B) C ker(A). 

As 7 r is surjective, so is tt A - Hence 1111 ( 71 ^ 4 ) = ker(A) D im(i?)- L . Also, for any 
x £ ker(A), n A (x) = 0 iff the component of x in in^B)- 1 - is zero, i.e., x £ im(B). 
Hence ker( 7 T/t) = irn(B). The first isomorphsim theorem, 

ker(A)/ker( 7 r J 4 ) = im(7TA) = ker(A) n 

yields the required result since im(B)- 1 - = ker(B*) by ®. The other isomorphism 
may be obtained as usual by using B*,A* in place of A, B. □ 


In mathematics, linear algebra usually refers to a collection of facts that follow 
from the defining axioms of a field and of a vector space. In this regard, every single 
statement in Theorems [ATj[01[A]| is false as a statement in linear algebra — 
they depend specifically on our working over a subfield of C and are not true over 
arbitrary fields. For example, consider the finite field of two elements F 2 = {0,1} 
and take 


A = B = 


1 

1 


1 

1 


Then A* = A = B = B*, and AB = B* A* = A* A = BB* = A* A + BB* = 0, 
which serves as a counterexample to ©, ©, CD, ®, ®, (D, ®, and Theorem A.3 


Appendix B. Div, Grad, Curl, and All That 

We provide routine verifications of statements claimed in Sections [3] and [4] 

Lemma B.l. Equip L 2 (V) and L\(E) with the inner products in ( |3.1[ ), we have 

n 

grad* X(i) = - ^ —X(i,j) = -divX(i). 

1 Wi 
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PROOF. The required expression follows from 
(grad* X, f) v = ( X , grad f) E 

= 'Y].^.WijX{i,j)gradf(i,j) 

= E ^ • w io x {^j)[fU) - /(*)] 


V 

' i<3 

w ijX(i,j)f(j)+y2 Wi 

z — 

? X C/ 

0/(0 

V 

WjiX{j,i)f(i) w h 

l< ~3 

x(j, 

0/(0 

V 

tr u X(j.i)f(i) • V <r 0 

z —'*<J 

x (i, 

0/(0 

V 

WijX(j,i)f{i ) 




£-^i= 1 

»'/E -j- — X U’ i ) 

/(* 

) 


v ■> ri 

^i=l 

Wi[Y2. ^XUJ)] 

L-*-— Wi J 

/(0 




grad* X(i) 


® follows from swapping labels i and j in the first summand. 

© follows from Wij = Wji. 

® follows from X(i,i) = 0. □ 


Lemma B.2. Equip If^(E) and L^(T) with the inner products in (3.1), we have 


curl* = 'S2 lU ^$(i,j,k). 

in - ■ 


k=l 


PROOF. The required expression follows from 
(curl* $,X) E = ($,curlX) T = V' w ijk ${i,j,k)cmlX(i,j,k) 

z — J i<j<k 

= '52 i<j<k Wi i k ®( i >j> k )[X(i,j) + X(j, k)+X(k,i)] 

= E-^t k)X(i,j) + V w ijk $(i,j, k)X(j, k ) 

z — J i<j<k z — J i<j<k 

+ V . Wij k $(i,j,k)X(k,i) 

L ' i< 3 <k 

= ^2 i<j<k 'Wijk^(i,j,k)X(i,j) + ^2 i< _ <k w ijk ${j,k,i)X(j,k) 


+ E- , w ijk$(k,i, j)X{k,i) 

z —' i<J<k 

= E ^ _ w ijk <&(i,j, k)X(i,j) + y\ . w kij $(i,j, k)X(i,j) 

z — *i<j<k z — *k<i<j 

+ y2^.^. w ikj®U>hk)X(j,i) 

z ' i<k<] 

= 'Y\^.^™iik§(i,j,k)X(i,j) + 'y\ w kij ${i,j,k)X(i,j) 

z ' i<j<k z — *k<i<j 

+ ^ 2 i<k<J w ikj®{i,j, k)X(i, j) 


E f(E" +E'~ 1 +E 

^<jLV^fc=i+i *-^k =l 


'k=i +1 


Wijk®{i,3,k) X(i,j) 
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E*,-^ 


^ijk 
k^i,j Wij 


curl* <f>(i,j) 

® follows from the alternating property of <f>. 

© follows from relabeling j, k, i as i, j, k in the second summand and swap¬ 
ping labels j and k in the third summand. 

© follows from <fr(j,i,k)X(j,i) = $(i,j,k)X(i,j ) since both changed signs. 
© follows from i) = = 0. □ 

Lemma B.3. The operator Ao = — divgrad gives us the usual graph Laplacian. 
Proof. Let / £ L 2 {V). By definition, 

grad f(,,j) = U (i) - m " £E ’ 

I 0 otherwise. 

Define the adjacency matrix A G M nxn by 

a _ f 1 if {i,j}£E, 

[0 otherwise. 

The gradient may be written as grad = ciij(f(j) — /(*)) and so 

(Ao/)(*) = -[div(grad/)](z) = — [diva.y(/(j) - /(*))](*) 


(B.l) 


= ~E = i “y [/(•?) _ /(*)] = d */w - E 


^— j j =i 

where for any vertex z = 1,..., n, we define its degree as 




E ti 

. , “if- 
1=1 

If we regard a function / £ L 2 (V) as a vector (/i,..., /„) £ K n where /(z) = ft 
and set D = diag(4,..., d n ) £ R” xn , then ( |B.1[ ) becomes 

-ai2 • • • —ai n /i 

_ — 022 • • • 02n fl 

Ao/ = ... ={D-A)f. 


d\ — an 

— «12 

®ln 

021 

4 — 022 

_ &2n 

&nl 

~O n 2 




'fi 


h 


jn_ 


So Ao may be regarded as D — A, the usual definition of a graph Laplacian. 

Theorem B.4. We have that 

curl grad = 0, divcurl* = 0, 
and more generally, for k = 1,2,, 

44-i = 0, 4-r^fc = o. 


□ 
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PROOF. We only need to check <4<4-i = 0. The other relations follow from 
taking adjoint or specializing to k = 1. Let / £ By (4.21 and (4.31, 


(<4<4-i/)(*o, ■ ■ ■ >4+i) = 2_s {-iy6 k -if(io, ■ ■ ■ ,ij, ■ ■ ■;4+i) 

= 55,= 0 (- 1 ) 0 [55^ = o• • • > 4, • • •, ij, • • •, 4+i) 

+ 5 5 ^ ( ~ 1) /(* 0) • • • > 4) • • ■ j 4j • ■ ■ j 4+l) 

= 51 ■ .i -1 )' 5 ( -1 )V(*o, • • ■, 4, • • ■, 4, • ■ •, 4+i) 

+ 55 • ,4, • • •, 4> ■ • •, 4+i) 

z — 

= 55 i< / _1 ) J+ ^(*0’ • ■ • ’4’ • ■ ■ >4: ■ • • , 4+l) 

+ 55 (-l) J+ ^7(4, • • • Jj, - • • ,4, • • •, 4+i) 

= 55 i< 5 _1 )' , ' M /(4>, • • ■. 4. • ■ ■. 4, ■ • •, 4+i) 

- 55 ■ ,( -1 ) J+ V(*0) • • •) 4> • • • 14, • • •, 4+i) = o. 

z —'j<£ 


The power of —1 in the third sum in © is £ — 1 because an argument preceding in 
is omitted and so in is the (l — l)th argument (which is also omitted). (D follows 
from swapping labels j and l in the second sum. □ 


Appendix C. Calculations 

We will work out the details of Example |4.1| While we have defined coboundary 
operators and Hodge Laplacians as abstract, coordinate-free linear operators, any 
actual applications would invariably involve ‘writing them down as matrices’ to 
facilitate calculations. Readers might perhaps find our concrete approach here 
instructive. 

A simple recipe for writing down a matrix representing a coboundary operator 
and a Hodge Laplacian is as follows: Given an undirected graph, label its vertices 
and edges arbitrarily but differently for easy distinction (e.g. we used numbers for 
vertices and letters for edges) and assign arbitrary directions to the edges. From 
the graphs in Figure [6| we get the labeled directed graphs G\ (left) and Gi (right) 
in Figure [9] 



Figure 9. The graphs in Figure |6j with vertices and edges arbi¬ 
trarily labeled and directions on edges arbitrarily assigned. 
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The next step is to write down a matrix whose columns are indexed by the 
vertices and the rows are indexed by the edges and whose (i, j)tli entry is +1 if jth 
edge points into the ?’th vertex, —1 if jth edge points out of the ith vertex, and 0 
otherwise. This matrix represents the gradient operator <5 q = grad. We get 



i 

2 

3 

4 

5 

6 



i 

2 

3 

4 

5 

6 

a 

' -l 

1 

0 

0 

0 

0 ' 


a 

' -l 

1 

0 

0 

0 

0 

b 

0 

-1 

1 

0 

0 

0 


b 

0 

-1 

1 

0 

0 

0 

c 

0 

0 

-1 

1 

0 

0 


c 

0 

0 

-1 

1 

0 

0 

ii 

a- 

i 

0 

0 

-1 

0 

0 

II 

<N 

d 

i 

0 

0 

-1 

0 

0 

e 

0 

0 

-1 

0 

1 

0 


e 

0 

0 

-1 

0 

1 

0 

/ 

0 

0 

0 

0 

-1 

1 


f 

0 

0 

0 

-1 

0 

1 

9 

0 

0 

-1 

0 

0 

1 


9 

0 

1 

0 

0 

0 

-1 


for G\ and G 2 respectively. Note that every row must contain exactly one +1 and 
one —1 since every edge is defined by a pair of vertices. This matrix is also known as 
a vertex-edge incidence matrix of the graph. Our choice of ±1 for in/out-pointing 
edges is also arbitrary — the opposite choice works just as well as long as we are 
consistent throughout. 

The graph Laplacians may either be computed from our definition as 


Lx = A\A! 


l 2 = a* 2 a 2 



i 

2 

3 

4 

5 

6 

1 

2 

-1 

0 

-1 

0 

0 

2 

-1 

2 

-1 

0 

0 

0 

3 

0 

-1 

4 

-1 

-1 

-1 

4 

-1 

0 

-1 

2 

0 

0 

5 

0 

0 

-1 

0 

2 

-1 

6 

0 

0 

-1 

0 

-1 

2 


i 

2 

3 

4 

5 

6 

1 

2 

-1 

0 

-1 

0 

0 

2 

-1 

3 

-1 

0 

0 

-1 

3 

0 

-1 

3 

-1 

-1 

0 

4 

-1 

0 

-1 

3 

0 

-1 

5 

0 

0 

-1 

0 

1 

0 

6 

0 

-1 

0 

-1 

0 

2 


or written down directly using the usual definition [2TU63] . 


! deg [vi) if i = j, 

— 1 if Vi is adjacent to Vj, 

0 otherwise. 


We obtain the same Laplacian matrix irrespective of the choice of directions on 
edges and the choice of ±1 for in/out-pointing edges. For us there is no avoiding 
the gradient operators since we need them for the graph Helmholtzian below. 

We may now find the eigenvalues of L\ and L 2 and see that they are indeed 
the values we claimed in Example |4.1| 

\{L X ) = (0, 3 - x/5, 2, 3, 3, 3 + VE) = \{L 2 ). 
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To write down the graph Helmholtzians, we first observe that G\ has exactly 
one triangle (i.e., 2-clique) whereas G 2 has no nip] We will need to label and pick 
an arbitrary orientation for the triangle in Gi: We denote it as T and orient it 
clockwise 3 —>■ 5 —» 6 —» 3. A matrix representing the operator <5i = curl may 
be similarly written down by indexing the columns with edges and the rows with 
triangles. Here we make the arbitrary choice that if the jth edge points in the same 
direction as the orientation of the ith triangle, then the (*,j)th entry is +1 and if 
it points in the opposite direction, then the entry is —1. For Gi we get 


a b c d d e f 

B± = t [ 0 0 0 0 1 1 -1 ]. 

Since G 2 contains no triangles, H 2 = 0 by definition. 

We compute the graph Helmholtzians from definition, 


Hi = A\A\ + B*B! 


H 2 = A 2 A* + B*B 2 



a 

b 

c 

d 

d 

e 

/ 

a 

2 

-1 

0 

-l 

0 

0 

0 

b 

-1 

2 

-1 

0 

-i 

0 

-1 

c 

0 

-1 

2 

-l 

i 

0 

1 

d 

-1 

0 

-1 

2 

0 

0 

0 

e 

0 

-1 

1 

0 

3 

0 

0 

f 

0 

0 

0 

0 

0 

3 

0 

9 

0 

-1 

1 

0 

0 

0 

3 


a 

b 

c 

d 

d 

e 

/ 

a 

2 

-1 

0 

-l 

0 

0 

1 

b 

-1 

2 

-1 

0 

-1 

0 

-1 

c 

0 

-1 

2 

-l 

1 

-1 

0 

d 

-1 

0 

-1 

2 

0 

1 

0 

e 

0 

-1 

1 

0 

2 

0 

0 

f 

0 

0 

-1 

1 

0 

2 

-1 

9 

1 

-1 

0 

0 

0 

-1 

2 


and verify that they have different spectra, as we had claimed in Example |4.1[ 

A (Hi) = (0, 3-\/5, 2, 3, 3, 3, 3 + V5) ^ (0, 0, 3^^, 2, 3, 3, 3 + ^) = A (H 2 ). 


We now repeat the routine and convert the undirected graphs in Figure [7] 
into labeled directed graphs G 3 (left) and G 4 (right) in Figure 10 We label both 



Figure 10. Labeled directed versions of the graphs in Figure [7J 


1 1 Those who see two triangles should note that these are really squares, or CTs to be accurate. 
See also Example 3.1 
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triangles in G 3 and G 4 as T and orient it clockwise 2 —► 1 —► 3 —> 2, giving us a 
matrix that represents both curl operators on G 3 and G4, 

a b c d e f g 

B 3 = B 4 = T [ 1 1 -1 0 0 0 0 ]. 

With these choices, we obtain the following matrix representations of the gra- 


dients, Laplacians, and Helmholtzians on G 3 

and G 4 , 






1 

2 

3 

4 

5 

6 7 



a 

1 

-1 

0 

0 

0 

0 0 ' 



b 

-1 

0 

1 

0 

0 

0 0 



c 

0 

-1 

1 

0 

0 

0 0 


^3 = 

d 

0 

0 

-1 

1 

0 

0 0 



e 

0 

0 

0 

-1 

1 

0 0 



f 

0 

0 

0 

-1 

0 

1 0 



9 

0 

0 

0 

-1 

0 

0 1 




1 

2 

3 

4 

5 

6 7 



a 

1 

-1 

0 

0 

0 

0 0 ' 



b 

-1 

0 

1 

0 

0 

0 0 



c 

0 

-1 

1 

0 

0 

0 0 


II 

d 

0 

0 

-1 

1 

0 

0 0 

) 


e 

0 

0 

0 

-1 

1 

0 0 



f 

0 

0 

-1 

0 

0 

1 0 



9 

0 

0 

0 

-1 

0 

0 1 




1 

2 

3 

4 

5 

6 

7 


1 

2 

-1 

-1 

0 

0 

0 

0 

1 


2 

-1 

2 

-1 

0 

0 

0 

0 


3 

-1 

-1 

3 

-1 

0 

0 

0 

L-,i — A ^ A 3 — 

4 

0 

0 

-1 

4 

-1 

-1 

-1 


5 

0 

0 

0 

-1 

1 

0 

0 


6 

0 

0 

0 

-1 

0 

1 

0 


7 

0 

0 

0 

-1 

0 

0 

1 

J 



1 

2 

3 

4 

5 

6 

7 


1 

2 

-1 

-1 

0 

0 

0 

0 

1 


2 

-1 

2 

-1 

0 

0 

0 

0 


3 

-1 

-1 

4 

-1 

0 

-1 

0 

L 4 = A* 4 A 4 = 

4 

0 

0 

-1 

3 

-1 

0 

-1 


5 

0 

0 

0 

-1 

1 

0 

0 


6 

0 

0 

-1 

0 

0 

1 

0 


7 

0 

0 

0 

-1 

0 

0 

1 

J 



a 

b 

c 

d 

e 

/ 

9 


a 

' 3 

0 

0 

0 

0 

0 1 

0 



b 

0 

3 

0 

-1 

0 

0 1 

0 



c 

0 

0 

3 

-1 

0 

0 1 

0 


h 3 = + b;b 3 = 

d 

0 

-1 

-1 

2 

-1 

-1 - 

-1 

:> 


e 

0 

0 

0 

-1 

2 

1 

1 



f 

0 

0 

0 

-1 

1 

2 

1 



9 

0 

0 

0 

-1 

1 

1 : 

2 
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a 

b 

c 

d 

e 

/ 

9 

a 

' 3 

0 

0 

0 

0 

0 

0 


b 

0 

3 

0 

-1 

0 

-1 

0 


c 

0 

0 

3 

-1 

0 

-1 

0 


h 4 = A 4 A* 4 + B* 4 B 4 = d 

0 

-1 

-1 

2 

-1 

1 

-1 


e 

0 

0 

0 

-1 

2 

0 

1 


f 

0 

-1 

-1 

1 

0 

2 

0 


g 

0 

0 

0 

-1 

1 

0 

2 


As we intend to show that G 3 

and G 4 have isospectral Hodge Ai-Laplacians for 


all k. we will also need to examine the Hodge 2-Laplacian A 2 . Since G 3 and G 4 
have no cliques of order higher than two, 5k = 0 for all k > 2 and in particular 
A 2 = 5±5l. So the lxl matrix representing A 2 is just 

P 3 = B 3 B* 3 = [3] = B 4 B* 4 =: P 4 

for both G 3 and G 4 . 

Finally, we verify that the spectra of the Hodge fc-Laplacians of G 3 and G 4 are 
identical for k = 0,1, 2, as we had claimed in Example |4.1| 

\(L 3 ) = (0, 0.40, 1, 1, 3, 3.34, 5.26) = A(L 4 ), 

A (H 3 ) = (0.40, 1, 1, 3, 3, 3.40, 5.26) = A (H 4 ), 

A (P 3 ) = 3 = A(P 4 ). 

Computational and Applied Mathematics Initiative, Department of Statistics, Uni¬ 
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